High-throughput sequencing of complete human mtDNA genomes from the Philippines.

نویسندگان

  • Ellen D Gunnarsdóttir
  • Mingkun Li
  • Marc Bauchet
  • Knut Finstermeier
  • Mark Stoneking
چکیده

Because of the time and cost associated with Sanger sequencing of complete human mtDNA genomes, practically all evolutionary studies have screened samples first to define haplogroups and then either selected a few samples from each haplogroup, or many samples from a particular haplogroup of interest, for complete mtDNA genome sequencing. Such biased sampling precludes many analyses of interest. Here, we used high-throughput sequencing platforms to generate, rapidly and inexpensively, 109 complete mtDNA genome sequences from random samples of individuals from three Filipino groups, including one Negrito group, the Mamanwa. We obtained on average ∼55-fold coverage per sequence, with <1% missing data per sequence. Various analyses attest to the accuracy of the sequences, including comparison to sequences of the first hypervariable segment of the control region generated by Sanger sequencing; patterns of nucleotide substitution and the distribution of polymorphic sites across the genome; and the observed haplogroups. Bayesian skyline plots of population size change through time indicate similar patterns for all three Filipino groups, but sharply contrast with such plots previously constructed from biased sampling of complete mtDNA genomes, as well as with an artificially constructed sample of sequences that mimics the biased sampling. Our results clearly demonstrate that the high-throughput sequencing platforms are the methodology of choice for generating complete mtDNA genome sequences.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genetic Continuity of Anatomically Modern Human between India and Island Southeast Asia Isea: Last Glacial Dispersal of Mtdna Lineage

ABSTARCT: Our complete sequencing of 220 mtDNA genomes from the Savara and Porja of east coastal India reveals about 25 per cent genomes belongs to European macro haplogroup N. For the first time we identified mitochondrial DNA from one south Indian Savara individual that shares seven specific mutations with the N22 lineage observed in the Orang Asli group of Aboriginal Malaya, Cuyonin from Pal...

متن کامل

Effective Extraction and Assembly Methods for Simultaneously Obtaining Plastid and Mitochondrial Genomes

BACKGROUND In conventional approaches to plastid and mitochondrial genome sequencing, the sequencing steps are performed separately; thus, plastid DNA (ptDNA) and mitochondrial DNA (mtDNA) should be prepared independently. However, it is difficult to extract pure ptDNA and mtDNA from plant tissue. Following the development of high-throughput sequencing technology, many researchers have attempte...

متن کامل

Targeted high-throughput sequencing of tagged nucleic acid samples

High-throughput 454 DNA sequencing technology allows much faster and more cost-effective sequencing than traditional Sanger sequencing. However, the technology imposes inherent limitations on the number of samples that can be processed in parallel. Here we introduce parallel tagged sequencing (PTS), a simple, inexpensive and flexible barcoding technique that can be used for parallel sequencing ...

متن کامل

Analysis of the complete human mtDNA genome: methodology and inferences for human evolution.

The analysis of mitochondrial DNA (mtDNA) sequences has been a potent tool in our understanding of human evolution. However, almost all studies of human evolution based on mtDNA sequencing have focused on the control region, which constitutes less than 7% of the mitochondrial genome. The rapid development of technology for automated DNA sequencing has made it possible to study the complete mtDN...

متن کامل

Title: False Negatives Are a Significant Feature of next Generation Sequencing Callsets

Short-read, next-generation sequencing (NGS) is now broadly used to identify rare or de novo mutations in population samples and disease cohorts. However, NGS data is known to be error-prone and post-processing pipelines have primarily focused on the removal of spurious mutations or "false positives" for downstream genome datasets. Less attention has been paid to characterizing the fraction of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Genome research

دوره 21 1  شماره 

صفحات  -

تاریخ انتشار 2011